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ABSTRACT 


Acoustic problem is a main issues of the existing classroom due to lack of 
absorption of surface material. Thus, a feed forward neural network system 
(FFNN) for classroom Reverberation Time (RT) estimation computation was 
built. This system was developed to assist the acoustic engineer and 
consultant to treat and reduce this matter. Data was collected and computed 


using ODEON12.10 ray tracing method, resulting in a total of 600 
rectangular shaped classroom models that were modeled with various length, 
width, height, as well as different surface material types. The system is able 
to estimate RT for 500Hz, 1000Hz, and 2000Hz. Using the collected data, 
FFNN for each frequency were trained and _ simulated separately 
Classroom (as absorption coefficients are frequency dependent) in order to find the 
Feed forward neural network optimum solution. The final system was validated and compared with the 
ODEON actual measurement value from 15 different classrooms in Universiti Tun 
Reverberation time Hussein Onn Malaysia (UTHM). The developed system show positive results 
with average validation accuracy of 94.35%, 95.91%, and 96.42% for 500Hz, 
1000Hz, and 2000Hz respectively. 
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1, INTRODUCTION 

Most of the existing classroom are designed for lecture-based education, meanwhile, the growth in 
education world toward Education 4.0, required the used of education tool and platform to become 
interactive. This finishes of existing classroom are fine for lecture-based education that may cause acoustic 
problem in the interactive education, and lead difficult to hear. Reverberation is one of the reasons of the 
acoustic problem which able to be treatable with the proper mix of absorption. While, surface material; 
ceilings, concrete walls, wooden walls, tile floors, and wooden doors, plays a critical role of lack of 
absorption that can creates an excessively reverberant room. 

The proposed system of Reverberation Time (RT) computation 1s able to assist the acoustic engineer 
and consultant to treat and properly locate the exact specifications of absorption material in order to reduce 
the classroom acoustic problem. RT is a time taken for the audio signal to drop by 60 dB. The first 
established formulae in estimating RT in an enclosed space was made by Sabine [1], as seen in (1). 
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With, 

V =room volume 

S = surface area 

a. = material absorption coefficient 

Considering the simplicity of the Sabine formulae on top of ignoring many other factors, a jarring 
error can be seen between the formulaic and the actual measurement RT value. Since then, many other 
researchers have made improvements and materialized with more solid and accurate formulae calculations in 
regard of RT [2-6]. Besides this fundamental calculations, other techniques and methods were also being 
presented such as the computer simulations ray-tracing techniques (ODEON) [7] and finite element models 
(FEM) [8-9]. As ODEON and FEM taking an extra time on preparations (designing room models, etc.) 
researchers are still seeking for other alternative methods in estimating RT for instance by using neural 
network (NN) [10]. 

Since 1999, researches on RT prediction using NN, were started by Nannarielo and Fricke [11], in 
order to examine the developed NN using dataset from actual large halls building. This group of researchers 
were successfully manage and prove that NN is useful in predicting room RT. In 2010, experiments to 
predict RT for classrooms using dataset gathered from FEM computation were reported by [12]. Research in 
predicting RT was continued by Aliabadi et al. in 2014 [13]. In this research they were able to strengthen the 
potential of NN method in minimizing the uncertainties in acoustics' modeling for industrial workrooms. 

The purpose of this research is to design an alternate method in estimating RT in a classroom that is 
cost effective and uncomplicated in addition to be able to provide users with an alternative prediction model 
with low percentage error. This research focuses on classroom RT computation in order to obtain the 
optimum sound between the teacher and students, by minimize the noise [14]. Moreover, poor working 
conditions in a classroom can be avoided as well as maintaining the teacher’s comfort in delivering the 
speech [15]. The ideal classroom RT value 1s in the range of 0.4 — 0.6 seconds, although most of the existing 
classrooms produced RT more than 1 second that can cause sounds confusion between teacher’s voice and 
it’s reflecting sounds [16]. Therefore, RT has become an important parameter in classroom architectural 
design in order to achieve and maintain the ideal RT value. Process flow of the proposed system was 
summarize as in Figure 1. 


Load data of surface material 
images and rectangular 
shaped classroom models 


Epoch > 1000 


Compute and simulate the 
RT output using ODEON Yes 
12.20 


Testing data and observe the 
accuracy and regression (R) 


Divide data into training and 
testing set 


Validation with actual room 
measurements 


Figure 1. Experimental process flow 
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2. DEVELOPMENT OF FEED FORWARD NEURAL NETWORK (FFNN) RT PREDICTION 
SYSTEM 

The neural network dataset was collected from rectangular shaped classroom models as seen in 
Figure 2. This classroom model was sketched using Google SketchUp with different in heights, widths, and 
lengths. Window was also added to some of the models. 

The RT output values from these classroom models were computed and simulated using ODEON 
12.10 ray-tracing simulation. Different materials were applied to the surface and the sound source was placed 
randomly for each model in the dataset. At the end, 600 data that consists of various models and surface 
materials were successfully gathered. 


SIS 





Figure 2. Samples of room models 


Frequency of 500Hz, 1000Hz and 2000Hz, were chosen in order to complete the train and analysis 
of FENN system. (2), (3), and (4) show RT for 500 Hz, 1000 Hz and 2000Hz, respectively. 13 variables were 
applied as FENN input features and requested to compute the RT. Input dataset geometrical characteristics 
for the FFNN training dataset was shown in [17]. 


RT500 = f(V, L, W, H, Saw1[500], Saw2[500], Safl[500], Sadoor[500], Sacei[500], Sawin[500], x/L, y/W, z/H) (2) 
RT1000 = f(V, L, W, H, Saw1[1000], Saw2[1000], Safl[1000], Sadoor[ 1000], Sacei[1000], Sawin[1000], x/L, y/W, z/H) (3) 
RT2000 = f(V, L, W, H, Saw1[2000], Saw2[2000], Safl[2000], Sadoor[2000] , Sacei[2000], Sawin[2000], x/L, y/W, z/H) (4) 


3. FEED FORWARD NEURAL NETWORK (FFNN) TRAINING PERFORMANCE 

In this experiment, 60%, 20%, and 20% data from the dataset is used as the training data, validation 
data and testing data, respectively. The training data is used to train and fit the models; the validation data is 
used to estimate the prediction error for the model selection, as well as prevent network from overfitting; the 
test set is used for the assessment of the generalization error of the final chosen model [18]. The test data 
should be unknown or new to the FFNN system. Figure 3 shows the regression plots for training, validation, 
and testing data for 5|00Hz, 1000Hz and 2000Hz, respectively. 

The optimum network for 500Hz, 1000Hz, and 2000Hz were lastly combined into one final system 
using Matlab GUI. Figure 4 shows the main page in the final system, where the users are required to fill up 
the values needed in the NN computation. 

Figure 5 shows the interfacing page of adding the material surface. The users have to insert the 
photographic surface image or select surface images from library as well as its dimension. Figure 6 shows the 
result page where the RT for 500Hz, 1000Hz, 2000Hz are displayed. 
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Figure 3. Regression value for (a). 500Hz, (b)1000Hz, and (c)2000Hz 








i] masn_rt ee ; 
Reverberation Time Estimation System using Neural Network 
No. Material Surface Dimension (2? ) 
Room dimensions 
| 10.84 x 10.172 288 1 Microacoustics Microperforated panel 110.26 
Length(m) = Width (m) Height (m) 2 Concrete block, painted 72.79 
3. Wood, 25 mm with air space 31.26 
fe 4. Marble or glazed tile 110.26 
Solid wooden door 3.55 
Width 
: Window (Glass) 13.49 
Sound source location (from point 0) —— 
Add surface 
2.36 x 486 x 1.37 
| x y z 





Calculate RT 











Figure 4. Main page of the system 
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Identify material 


Material type: Concrete block, painted 


1000Hz 20008 z 
Frequency 


Absorption coefficient: 





Reverberation Time (s): 





500Hz | 1000Hz | 2000Hz | 
0.0600 0.0700 0.0900 





500Hz 1000Hz 2000Hz 
0.7339 0.8754 0.9648 





Dimension : 72.79 mi? 





Figure 5. Interface page of adding Figure 6. Result page of the system material surface 


4. VALIDATION WITH ACTUAL ROOM MEASUREMENTS 

The finished system is then applied to the actual classrooms. 15 classrooms (CRO1 - CR15) in 
UTHM were sampled and real-time RT measurements were taken using the regular reverberation room 
method. Figure 7 shows an example of the actual RT measurement. For comparison, the sampled classrooms 
were also modeled and simulate using ODEON 12.10 as shown in Figure 8. The geometrical characteristic 
for these 15 classrooms are compiled in Table 1. 
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Figure 7. Example of actual room measurement using reverberation time method 








Figure 8. Samples of modeled actual classrooms using ODEON 12.10 
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Table 1. Room Geometrical Characteristics for NN Validation Dataset 


Max Min Mean Standard Deviation 
V (m’) S202 300.41 319.899 6.16 
H (m) 2.88 2.82 2.86 0.025 
L (m) 10.89 10.20 10.84 0.162 
W (m) 10.44 10.07 10.35 0.102 
SOw1/500] 4.68 2.53 4.37 ().94 
SOw2/500] 5.63 2.76 2.82 1.37 
SOsoo] 1.14 1.07 1.12 0.02 
SOdoor{ 1000] 0.21 0.21 0.21 0.00 
SQcei[ 1000] 112.41 94.27 110.57 5.66 
SOwin[ 1000] 24S 1.42 1.94 0.39 
SQw1[1000] 5.47 2.96 5.10 1.10 
SQw2/1000] 3.75 1.84 1.88 0.91 
SOy1f1000] 1.14 1.07 | ea be 0.02 
S@Qdoorf 1000] 0.28 0.28 0.28 0.00 
SQcei[ 1000] 93.11 72.09 91.73 6.67 
SOwin[ 1000] 1.62 0.95 1.29 0.26 
SOw1/2000] 7.03 3.80 6.55 1.41 
SQw2/2000] 3.75 1.84 1.88 0.91 
SO1[2000] Oe) 2.13 2.24 0.03 
SQdoor[2000] 0.36 0.36 0.36 0.00 
SQceif2000] 86.30 77.64 85.01 2.69 
SOwin[2000] ().94 0.55 0.76 0.15 
x/L 0.515 0.178 0.218 0.079 
y/W 0.513 0.415 0.478 0.024 
SH 0.555 0.408 0.454 0.036 


* V, room volume; L, length; W, width; H, height; Sows, equivalent absorption coefficient of walll area; 
SQw2, equivalent absorption coefficient of wall2 area; Sa, equivalent absorption coefficient of floor area; 
SQdoor, equivalent absorption coefficient of door area; Sdcei, equivalent absorption coefficient of ceiling area; 
SQwin, equivalent absorption coefficient of window area; and x/L, y/W, z/H are the sound source position. 


Table 2-4 show the results comparison between the actual physical measurement, ODEON 12.10 
simulation, and the proposed FFNN system for frequencies of 500Hz, 1000Hz and 2000Hz, respectively. 
As the initial FFNN trainings were done per frequency, the validation tables were separated in order to 
observe each FFNN system efficiency. 

From the results obtained, it can be observed that the error between the proposed FFNN system and 
the actual physical measurement are all within the accepted range of +0.1s with the average accuracy 
percentage of 94.35%, 95.91%, and 96.42% for 500Hz, 1000Hz, and 2000Hz, respectively. The average 
percentage accuracy between the proposed system and the actual physical measurement is 1.18% lower for 
the 500Hz frequency, 2.2% higher for the 1000Hz frequency, and 1.1% higher for the 2000Hz frequency than 
the percentage accuracy between the ODEON simulation and the actual physical measurement. This shows 
that the proposed system managed to produces up to par results as the ODEON ray-tracing simulation for 
classroom RT estimation. 


Table 2. Validation for 500Hz Frequency 


ee eer ae Proposed % accuracy % accuracy 

Class room (s) ODEON (s) Neural ODEON vs proposed vs 

network (s) measurement measurement 
CRO1 0.74 0.71 0.73 95.95 98.65 
CRO2 0.75 0.76 0.79 98.67 94.67 
CRO3 0.74 0.81 0.79 90.54 93.24 
CRO04 0.72 0.77 0.80 93.06 88.89 
CRO5 0.77 0.71 0.81 92.21 94.81 
CRO06 0.72 0.72 0.81 100.00 87.50 
CRO7 0.80 0.74 0.81 92.50 98.75 
CRO8 0.81 0.76 0.78 93.83 96.30 
CRO9 0.83 0.75 0.78 90.36 93.98 
CR10 0.80 0.78 0.79 97.50 98.75 
CRI1 0.73 0.72 0.78 98.63 93.15 
CR12 0.74 0.72 0.81 97.30 90.54 
CR13 0.80 0.79 0.85 98.75 93.75 
CR14 0.81 0.78 0.83 96.30 97.53 
CRI15 0.76 0.74 0.80 97.37 94.74 
Average 99:55 94.35 
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Table 3. Validation for 1OO0Hz Frequency 


ee nearer Proposed % accuracy % accuracy 

Class room (s) ODEON (s) Neural ODEON vs proposed vs 

network (s) measurement measurement 
CRO1 0.85 0.88 0.88 96.47 96.47 
CRO02 0.85 0.92 0.88 91.76 96.47 
CRO3 0.87 0.97 0.87 88.51 100.00 
CR04 0.86 0.95 0.88 89.53 97.67 
CRO5 0.85 0.89 0.90 95.29 94.12 
CRO06 0.81 0.90 0.90 88.89 88.89 
CRO7 0.87 0.85 0.89 97.70 97.70 
CRO8 0.90 0.94 0.87 95.56 96.67 
CRO9 0.89 0.85 0.88 95.51 98.88 
CR10 0.88 0.83 0.87 94.32 98.86 
CRI1 0.95 0.89 0.87 93.68 91.58 
CR12 0.82 0.88 0.88 92.68 92.68 
CR13 0.92 0.99 0.96 92.39 95.65 
CR14 0.92 0.97 0.94 94.57 97.83 
CR15 0.84 0.83 0.88 98.81 95.24 
Average 93.71 95.91 


Table 4. Validation for 2000Hz Frequency 


TE cere eer Proposed % accuracy % accuracy 

Class room (s) ODEON (s) Neural ODEON vs proposed vs 

network (s) measurement measurement 
CRO1 0.91 0.89 0.96 97.80 94.51 
CRO2 0.88 0.93 0.90 94.32 O7.13 
CRO3 0.92 0.97 0.89 94.57 96.74 
CRO04 0.89 0.94 0.87 94.38 9145 
CRO5 0.85 0.90 0.89 94.12 95.29 
CRO06 0.84 0.89 0.89 94.05 94.05 
CRO7 0.93 0.85 0.88 91.40 94.62 
CRO8 0.89 0.87 0.88 97.75 98.88 
CRO9 0.93 0.88 0.89 94.62 95.70 
CR10 0.93 0.83 0.87 89.25 93.55 
CRI1 0.91 0.91 0.91 100.00 100.00 
CR12 0.84 0.88 0.88 95.24 95.24 
CR13 0.92 0.91 0.91 98.91 98.91 
CR14 0.93 0.89 0.89 95.70 95.70 
CR15 0.86 0.84 0.88 97.67 97.67 
Average 95.32 96.42 


5. CONCLUSION 

An RT estimation system was built using feed forward neural network and the data for the FFNN 
training was computed using ODEON 12.10 ray-tracing method. The built system shows positive results with 
average validation accuracy of 94.35%, 95.91%, and 96.42% for SOOHz, 1000Hz, and 2000Hz respectively 
compared to the actual measurement using reverberation room method. From the results gathered, the built 
system has shown a huge potential for commercialization although the system works are still wide open for 
exploration and improvement, such as applying the adaptive filter to eliminate the source of speaker 
identification noise [19]. 
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